Mutual Information-Based Variable Selection on Latent Class Cluster Analysis
نویسندگان
چکیده
Machine learning techniques are becoming indispensable tools for extracting useful information. Among many machine techniques, variable selection is a solution used converting high-dimensional data into simpler while still preserving the characteristics of original data. Variable aims to find best subset variables that produce smallest generalization error; it can also reduce computational complexity, storage, and costs. The method developed in this paper was part latent class cluster (LCC) analysis—i.e., not pre-processing step but, instead, formed LCC analysis. Many studies have shown analysis suffers from problems has difficulty meeting local dependency assumptions—therefore, study, we selecting using mutual information (MI) Mutual symmetrical measure carried by two random variables. proposed applied MI-based analysis, and, as result, four were selected use LCC-based village clustering.
منابع مشابه
Latent Class Analysis Variable Selection.
We propose a method for selecting variables in latent class analysis, which is the most common model-based clustering method for discrete data. The method assesses a variable's usefulness for clustering by comparing two models, given the clustering variables already selected. In one model the variable contributes information about cluster allocation beyond that contained in the already selected...
متن کاملBayesian variable selection for latent class models.
In this article, we develop a latent class model with class probabilities that depend on subject-specific covariates. One of our major goals is to identify important predictors of latent classes. We consider methodology that allows estimation of latent classes while allowing for variable selection uncertainty. We propose a Bayesian variable selection approach and implement a stochastic search G...
متن کاملFeature Selection Based on Joint Mutual Information
A feature/input selection method is proposed based on joint mutual information. The new method is better than the existing methods based on mutual information in eliminating redundancy in the inputs. It is applied in a real world application to nd 2-D viewing coordinates for data visualization and to select inputs for a neural network classiier. The result shows that the new method can nd many ...
متن کاملOn Classification of Bivariate Distributions Based on Mutual Information
Among all measures of independence between random variables, mutual information is the only one that is based on information theory. Mutual information takes into account of all kinds of dependencies between variables, i.e., both the linear and non-linear dependencies. In this paper we have classified some well-known bivariate distributions into two classes of distributions based on their mutua...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Symmetry
سال: 2022
ISSN: ['0865-4824', '2226-1877']
DOI: https://doi.org/10.3390/sym14050908